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AESTBACT 

This paper reports on an investigation of th€ 
voice^onset-time {VOT) characteristics of word^initial stops produced 
by four four-year-old children- Instrumental analysis of the 
children's spontaneous speech showed that they haa distinct 
distributions for voiced and voiceless stops at all three places of 
articulation and that there was very little overlap fcettfeen voiced 
and voiceless cogna^js. Cosopariscn of words uttered in isolation with 
words uttered in a^entence context revealed that these children had 
shorter VOT values in runnina speech, Beanalysis of data from younger 
children showed evidence of such modification according to context at 
age 1:9. However, even at age four the data were not coffipietely 
adult^like: comparad with adults, the f our^year^olds had longer mean 
VOT values^ there was a greater range^- of values, and there was less 
modification according to sentence context. The f our ^year-oids* data 
were interpreted as further evidence for the position that in 
producing voiceless stops in terms of mean VOT values, children first 
overshoot adult values and then only gradually draw them back towards 
idult values, (Author) 
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ABSTHACT* This paper reports ou an ij'^ vestlgatlon the voice-^nsat-^tia^ 
lTS (VOT) charactarlstics of word-^lclal. stops produced by f^ur four-yaar^Isf child- 
OJ ren. Instrumental anmlyais of the children's spontanaoua 'i-pMch showed v.hac thay 
had distincc distributions for voiced md voicaless sto^s at all thrae n 



r— * artieulation and that thera was very little overlap between voicad vaicfciess 
fT> CQ^nates. Comparison of words uttered In isolatioa wlr.h words utCMet in a sen-- 
LiJ tence cor^i^a^^t raveled that these childr^ai had shorter VOT values i;^ Diimlng 

speech. Reanalysis of data from younger children showed $vi4enctr. of such modif-^ 
ication acicording to context at age 1^9* However, es^an tout ^he data were 

not completely adult-like: compared with adults , the fQnr-ycai^'HDldg had longer 
m^n TOT values j there was a greater range of values and chsra was l&^s modific- 
ation according to aentence content. The four-year-^lda " data were ijicerpreted 
as further evidence for the position that in producing voioeless stops in teras of 
mean TOT values children first overshoot adult values and then only gradiially draw 
th^ ^back towards adult values* 



1,0 Introduction 

^ In this paper we report data on the voice onset time (VOT) character- 

^ istics of word-initial stops to the speech of four-year-old American English- 
speaking children and compare them with similar data from two-year-olds and adults, 
We use VOX here as a phonetic measure to trace the acquisition of the voicing con- 
trast in Ebaglish stops* 



<) 



WhUe several phonetic parameters underlie the control of vo icings VOT has so 
far proved to be the most reliable acoustic measure* In accordance with Lisker ^ Abramsori 
1964 and subse^ent literature, VOT is defined as the ttae inter^^al betwem the burst of a 
stop and the onset of glottal pulsing- Its physiological basis can probably be found 



1, Thia resSrch ia part of nhm nC^:ivities of the Stanford Child Phonology 
Project* This is part of a larger study of the acquisition of the voicing con^ 
trast in English* Spanish and Cantonese, which has bera supported by a National 
Scimce Foundation Grant (BNS 7&-^0B9m} m Charles A. Ferguson and Dorothy A* 
Huntington, Departments of Linguistics and Heartag and Speech Sciences* We 
gratefully aclmowledge their support during all phases c£ the research. Tliio 
paper will aloo appear to Languaoc Speech 1980, 
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in craaporal control of the larynx in relation to supraglottal aetivity (Lisker & 

AbrAmsionDul97i). English has a voicini contrast at three pUces of articulation. 
With tha voiced scops /b,d,g/ the onset of glottal pulsing Is asually at or shortly 
aftfflr the burst (the "short lag" range), although some speakers tend to^^produce 
vrt if, ^^d stops with slo"al pulsing prior to the burst (the voicing "lead" range,, da- 
ftGtad by negative VOT values). The voiceless stops /p,t,Sc/ are produced with a 
.snger time interval batwaen the burst and glottal pulsing (the "long Ug" range). 
WT also varies by place of arciculation alveolar stops having longer VOT values 
than labial stops, and velars having longer values than alvwlara. 

Lisker & Abramson (1964) give figures for adult speakers of American 
English. Utterances of /b.d.g/ with lead are analysed separately from those in 
the short lag range. For words spoken in Isolation they give means of 1ms,, 5ms. 
and 21ms. for short lag /b.d.g/ respectively and -101ms., -102m. and -88ms. for 
/h d g/ '>?ith lead values. The means for the voicaless series ,p,t,k/ are 58ms., 
70ms' and 80ms. respectively. In Lisker & Abramson's isolation data there is no 
overlap in the range of values for voiced and voiceless stops at any of the three 
places of articulation. tJhen the wards are spoken in sentences, the VOT distinction 
is r^uced: the VOT values for voiceless stops are considerably shorter than wnan 
spoken in Isolation and the ranges for voiced and voiceless cognates overlap. 



In our longitudinal study o:: two-year-cld English-spaaking children we 
found evidence for three stages in the acquisition of the voicing contrast. In the 
first stage the children do not produce any distinction in their speech in terms of 
VOT In stops which are phonetically voiced and voiceless in the adult form: most 
tokens for both sets of stops fall li the range 0-30ma. and there are no differences 
in the mean values for voiced and voiceless phonmea. In the second stage the dis- 
tribution for voiced and volcelass phonmes at any one place of articulation are 
highly overlapping J most tokms for both stops are still in the short lag range but 
Che means for the voicaless stops are significantly longer than the means for the 
voiced stops. This is the first stage at which there Is evidence of a distinction 
between the stops in the children's productions. We have hypothesised that under 
normal liscenlng conditions adults are not aware of this first actempt at a dis- 
tinction In Che third stage the children at first overshoot the adult values, 
producing voiceless stops that are considerably longer than f.heir adult counterparts, 
and they then gradually draw them back towards the adult values. 

The study of four-year-olds presenced here provides a data point two years 
later when children have acquired productive control of mst of the phonological op- 
positions of Etaglish but are stUl refining phonecic detail. The queations to be 
asked of these data are 1) what evidence can be found for developmental changes be- 
tween age two and four that Indicate qualitative differences in speech production 
at the two ages and 2) to what extant have the four-year-olds mastered the phon- 
etic details of the adult VOT systm? 

In order to compare our data with the two-year-old data and with adult 
data we carried out two analyses. We first analysed a sample of spontaneous speech 
to get our overall view of the VOT characteristics of the four-year-olds stops. 
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However, it is difficult to compare this directly with adult data; a sample of spokon 
Speech contains a web of interwoven variables which comes under the general rubric of 
''speech context" and which affact VOT considerably in adult speech/ They include 
stress, speech rata, s>Titactic category, phonetic environment, sentence conte:xt and 
utterance length (Lisker & Abramison 1967; Disimoni 1974; Klatt 1975), Given the 
vagaries of child speech, it is difficult to categorise tokens in terms of many of 
these variables. However, Gome such breakdoxm is necessary for a direct comparison 
with adult data. Sentence context (i.e, at the simplest level the distinction be= 
tween whether a token is uttered in. isolation or in a sentence) is one variable which 
can be reliably assigned, and its effects on VOT In adult speech have been documented 
(Lisker & Abramson 1967). Further, of the variables mentioned above it appears to be 
the one which affects VOX values the most. 



Our second analysis therefore looked at tokens which had been uttered 
in isolation and compared them with tokens in sentences. The breakdown of the data 
into these two categories gives a more precise comparison with adult data and pro- 
vides another way of looking at the extent to which the f our-year-^olds ' data are 
adult-like. It is also important for our comparison with the younger children: 
since the proportion of isolated tokens changed over time during the longitudinal 
study, spontaneous speech of the younger children could not be compared directly 
to the spontaneous speech of the older children, 

2.0 Method 



The subjects for this study were four children, two girls and two boys, 
who will be referred to as Karen, Hilary, Dan and John. Karen and Hilary were 
flrst-bom and Dan and John second-born. They were all middle class monolingual 
speakers of Ajmerican English who had been exposed to no other languages in the home 
and whose English was developing normally. Their ages ranged from 3;10.19 to 4;2,9 
when they were first recorded 

The children were recorded in a high quality sound- isolated room at the 
Stanford Speech Research Laboratory. Each child was recorded for half an hour on 
two occasions within a period of two weeks. Recording was carried out on a Revox 
A77 tape recorder with a Sony Electrat microphone which was attached to a soft cloth 
vest which the child wore. The recording room was decorated as a playroom. The 
child sat with an experimenter and talked while playing with toys and looking at 
books. An observer sat in the control room taking notes on the sessions and oper- 
ating the tape recorder. 

The tapes of the samples of spontaneous speech obtained from these ses- 
sions were transcribed using a Revox A77 tape recorder and Superex St-pro B-V head- 
phones. Words beginning with a singleton stop were transcribed phonetically and 
also categorised in terms of the sentence context in which they were uttered. They 
were first divided into two categories on the basis of whecher or not the words 
were uttered in isolation. The non-isolated tokens will be referred to as the sen= 
tence data (although they in fact include a few instances of counting numbers and 
listing objects which are not true sentence f rames) » The sentence tokens were bro- 
ken down into four further categories^ sentence initial , sentence final, embedded 
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In a sentanca and ambedlded in a list. 



From the tapes of aach child we Inscrumantally analysad the first fiftean 
word-initial tokens of each of the six stop consonaiit^ of English. CFurther tokens 
were later analysad for the comparisoh of contexts,) The VOT of the word-^init lal 
stops was measured usiiig a Kay Sonograph 7029 MC linked to a TektroniK Oscillo- 
scope 564 by means of a locallv dasigned and constructed triggering system. Fur-- 
ther details of the procedures which were comon to all our studies can be found 
In Huntington et al 1978* Frequency distributions were calculated for the data 
for each stop for each child; means ^ standard deviations and ranges were calculated 
and the differences between the Mans for voiced and s/ciiteless cognates were co^-- 
pared using one-tailed t-tests. to the later analyst.^ »f the voiceless stops by 
context were also compared using one^tailad t-test&. 



3.0 Resi- 



The overall results for the first fifteen tokens of each of the six 
stops are given in Table One. Production distributions are given separately for 
each of the four children; since their results were broadly similar to each other, 
their combined results are also given , 



The data in this analysis represent a sample of the spontaneous speech 
of these children. All four children had distinct distributions for voiced and 
voiceless stops at all three places of arc.iculation. There was a small amount of 
overlap in the ranges for voiced and voiceless tokens; individually^ three of the 
children showed this at all three places of articulation, while the fourth, Karen, 
exhibited no overlap. In the combined results the Interplace relationship found 
in adults was followed, the alveolar stops being longer than the labials, and the 
velars being longer than the alveolars,. although none of the Individual children 
followed this pattern in all respects* There were five instances of adult-'like 
lead values and a furthH^r three tokens where there was sligh!: lead, 0 to -10ms*, 
which should probably bfe classified with short lag rather than with lead* (Such 
"short lead" does not occur In adults.) Six of the eight lead tokens were for /b/ . 



The second analysis was in terms of the speech context in which the words 
were uttered. Dealing with each of the voiceless stops /p,t,k/, for each child we 
compared "isolated" tokens and "sentence" tokens, instrumentally analysing addltiona 
words from the recordings where necessary. We aimed to obtain fifteen tokens in eacl 
category for each^ chlld$ this was possible with the sentence data but there were not 
fifteen isolated tokens for each stop in the recordings. 



The res'ilts of this anaiysis are given in Table Two, Although there is 
considerabie variability in the individual children's results, the general pattern 
is that, as expected3the sentences tokens have shorter VOT values than do the isolated 
In the- combined comparisons the means for the sentence data are shorter than those 
for the isolat^id data at all three places of articulation, although none of the dif- 
ferences between the means are statistically significant. In adults the actual pos- 
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1 



Child 




HILARY 


DAN 


JOM 


COMBINED 


Age 




4;0.15 


3;10.19 


4;0.28 




/b/ iffiAN 
H 
SD 
RANGE 


3.80 
15 
4,97 
-4/lS 


7.47 
15 

5.96 
0/18 


4.07 
15 
47.61 
-117/54 


-1.53 
15 
33.27 
-90/21 


3.45 
60 
28.73 


/p/ .^AN 
N 
SD 

RANGI 


76.60 

15 
32.71 
41/143 


82.00 
15 
43.33 
3/174 


76.27 
15 
36.67 
25/162 


63.40 
15 
32.25 
21/106 


74,57 
60 

36,23 

J/ 1 


/d/ MEAN 
K 
SD 

RANGE 


11.80 
15 
8.85 
-5/31 


14.07 
15 
11.20 
0/41 


22.47 
15 
12,49 
6/50 


20.33 
15 
13.' o 
3/i 


17.17 
60 
12.11 


/t/ MEAN 
N 
SD 
RANGE 


1 

87.13 
IS 
56,99 
34/236 


64.47 
15 
49.22 
4/163 


74.93 
15 
32,11 


88.67 
15 
31.70 
'♦1/150 


78.80 
60 
43.90 
4/236 


/g/ tEAN 
N 
SD 
RANGE 


10.73 
15 
6.92 
0/23 


20.13 
15 
8.89 
0/39 


31.40 
15 
13.27 
15/58 


17.73 
15 
37.77 
-113/45 


20.00 
60 
21.60 

-113/58 


/fe/ MEAN 
N 
SD 
RANGE 


82.80 
15 
24.69 
48/125 


87.87 
15 
53.60 
27/204 


90.40 
15 
49.37 
40/224 


82.40 
15 
21.74 
35/126 


85.87 
60 
39.10 
27/224 



Tsbla 1, English fmir-year*aldsi man VOT values In nllllaeconds, 
number of tekass, atandard devlaelans and range for earh stop, (All 
differenees In m^an values between voised and ^oleelais cognates ara 
significant at p<.001.) 







KAREN 


HILARY 


DAN 


JOHN 


COIfflrNED 


/p/ 


ISOLATED 
SEMTmCE 
EHBEDDED 


68.00 (6) 
71,00 (15) 
68.09 (11) 


107.55 (11) 
78.00 (15) 
63.20 (5) 


81.77 (13) 
65.47 (15) 
56.36*(11) 


62.75 
61.87 
67.64 


(8) 

(15) 

(11) 


83.05 (38) 
69.08 (60) 
63.92*(38) 


/t/ 


ISOLATED 
SENTENCE 


100.00 (2) 
78.73 (15) 
57.82*(11) 


77.00 (6) 
50.80 (15) 
36.50*(10) 


80.43 (7) 
64.60 (15) 
67.80 (10) 


79.30 
81.93 
66.56 


(10) 
(15) 
(10) 


80.72 (25) 
69.02 (60) 
54. 40* (41) 


/k/ 


ISOLATED 
SEfraiCl 
EHBEDffiD 


86.43 (7) 
80.53 (15) 
73.10 (10) 


138,67 (6) 
96.60 (IS) 
60.30*(10) 


75.20 (10) 
85.80 (15) 
83.50 (10) 


90.80 
76.00 
75.40 


(10) 
(15) 
(10) 


90.97 (33) 
87.47 (60) 
73.08*(40) 



Table 2, liigllsh four-year -olds i man VOT values 1q ollllseconda 
and number of tokraa for voleeUas stops , broken doim by speech 
aonCexes« 

* al^ificant difference between Mans for Isolated and ei^edded 
tokens (p<,OS), 
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ition of a word in a sentence can affact the VOT (e.g, prepausal lengthening in 
sentence final position, laatt, 1975). To imke a finer analysis, although with 
less data, we therefore dealt solely with tokans embedded in a sentence (i.e. ex-- 
eluding those in initial position^ in final position and in lists) and compared 
these with the isolated tokens* 

These data are given in Table Tm. In general, the means for the embedded 
tokens were lower than the overall sentence figures. When compared with the isolated 
tokens the means for the embedded tokens were significantly lower at all three places 
of arclculation. This was also true for some of the Individual children's results- 

4*0 Comparison with adults 

The four-year-olds data is similar to that reported for adults. (We have 
used Lisker & Abramson's 1964 data for comparison. The various studies of adult 
speech give slightly different values | inspection of these other studies (e.g, Klatt, 
1975, Hoslin, 1978), however, shows Lisker & Abramson's orlgtoal values to be rep- 
resentative*) In both adults and four-year-olds there were clear VOT distinctions 
between voiced and voiceless tokens at each place of articulation. The means for 
velar stops were longer than those for alveolars, which in turn were longer than 
those for labials. Overall in the sentence data there was some overlap between 
voiced and voiceless distributions at each place of articulation. However, the 
four-^year-olds data were not completely adult-like: the mean values were longer 
than those for adults, they did not reduce as much in different sentence contexts, 
and they exhibited a greater range. 



Lisker & Abramson 1967 have VOT data for Isolated tokens and for sentence 
tokens which can be compared with the f our--year^old data here* These are given in 
Table Three. Across all three places of articulation the children "S mean values are 
consistently longer than the adults* This is true both for isolated tokms and 
for sentence tokens. The ratio of the sentmce to the isolated means (S/1) shows 
the amount that VOT values are modified in the two conterts* These ratios for adults 
and four-year-olds at each place of articulation are also given in Table Three. We 
can see from the table that when the tokens are in sentences ''01 values are reduced 
far more by the adults than by the children. 



The children also exhibited a greater range of VOT values than the adults 
In Lisker ^ Abramson 's study. With far more tokens the adult isolated values for 
/p,t,k/ ranged over 100ms., 75ms, and 85ms., repectlvely, while the children's 
isolated values for the same stops ranged over 228ms,, 98ms* and 262ms*, respec^ 
tlvely. There Is one difference between the two studies In the manner of data 
collection which could possibly account for this variation. The adults were read- 
ing sentences and presumably aiming for consistency (see Lisker & Abramson 1967, 
pl8), whereas the child data were taken from naturalistic speech which probably 
included a greater range of expressiveness and hence variability* As noted before. 
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FOUR-lfEAR 


ADULTS 






OLDS 




hi 


ISOUTED 


83.05 


58.00 




SENTENCE 


69.08 


33.00 




RATIO S/I 


.83 


.57 


Itl 


ISOUTID 


80,72 


70.00 




SENTaiCE 


69,02 


38.00 




RATIO s/r 


.86 


.54 


/k/ 


ISOLATED 


90.97 


80.00 




SENTENCE 


87.47 


48.00 




RATIO S/I 


.96 


.60 



_ _ _ _ - - — ^ 

Tablt 3* Ingllsh four^year^oldst oiean VOT values in ffillliseeonds 
far isolated and i«iiteiiae data aei^ared with adult data fro« Liakar 
and Abraumi 1964« P«394i 1967, p«10. 



there were very few instances of adult-like voicing lead in the four-year^Qlds' 
data. This suggests that four-year-olds use prevoieed stops (i*e* with lead VOT 
values) less often than adults i however ^ we cannot be certain from the results of 
just four childran, since not all adults use lead. 

The picture that emerges from the comparison of these children with 
adults is that the four'-year-old childran have consistently longer VOT values for 
voiceless stops ^ that they do modify their VOT values In the two different con-- 
texts J but that they do not modify them to as great an eKtent as adults do* They 
also appear to exhibit more variability then adults* 

5.0 Comparison with two-*year-olds 

It is difficult to compare these data on four=year-olds with the results 
of our longitudinal study of two-year-olds* Each of the four younger children was 
at a different stage of acquisition at the end of the study* One chUd did not 
have a claar contrast batween voiced and voiceless phonanas and was still produce 
ing inost tokens of both voiced and voiceless ptonemes with short lag voicing. The 
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other three had overshot adult values for voiceless tokens; one child was certain- 
ly beginning to draw her VOT values back toward adult values, and another, although 
exhibiting greater variability, appeared to be close to adult values by the last 
session- 



Looking at the overall data, not broken down by sentence context , if we 
compare the four-year-olds' data with the data from the last sessions of the three 
two-year-olds who had a clear voicing contrast (Macken & Barton 1977), there were 
no obvious differences in mean VOX values, nor In variability expressed in terms 
of either standard deviation or ranges* There was, however, less inter-^subject 
variaiton In the four-year-olds. 

Since the four-year-olds modified their VOT values in different sen^ 
tence conteKts, we re-examined the two-year-old data to look for similar effects. 
The two-year-olds spoke less than the f our-year--olds, so there is less data and 
less certainty with the comparison, but it is possible to compare the isolated 
and the sentence tokens on the last sessions of the three two-year-olds who had a 
clear voicing contrast (ages 2; 0.10 to 2; 1.17). The means for the ovarall data 
and the breakdown into Isolated and sentence tokens are given In Table Four. (Be- 
cause of the differences between the children we have not combined the data here,) 
In all of the eight comparisons between Isolated and sOT^tence data, the means are 
lower for the sentence data* It is clear from these figures that these two-year- 
old children were also modifying their VOT values in the different contexts* 
Tlius, like the four-year-olds, their speech was sensitive to the context in which 
It was uttered. Since the data are spread thinly throughQUt several categories, 
we are hesitant to give a precise figure, but the data certainly suggest that the 
two-year-old children modified their speech In the different contexts to a similar 
extent as the four^year-=olds# 



Given that the two-year-old children modified their speech according 
to context on their last sessions, there are two reasons for wanting to look back 
through the earlier sessions of the longitudinal study* Firstly, we want to be 
sure that this systematic Influence does not affect our earlier claims of the child-- 
ren overshooting the adult VOT values and then gradually drawing their values back. 
It appears that although the proportion of Isolated tokens changed over timm this 
did not affect our findings* To take the emmple of one of the two-year--olds , 
Tom, in the five samples of his language that we analysed between the ages 1;6*24 
and 2*1.17, the percentage of iMlateH tok^s In the samples want down consistently 
from 100% to 9%* If we had solely analysed isolated utterances throughout, the 
mean VOT values would have been longer towards the end of the study* Nevertheless, 
both isolated and embedded tokens followed the same pattern of increasing and then 
decreasing. 

The second reason for looking back at the earlier sessions of these 
children Is to find out if they had always been sensitive to conteKt, even when 
they first produced words in sentences. For all three children modification took 
place earlier than these last sessions. We traced evidence of modification back 
to ages 1;11*8, 1|9*2 and 1|9*15 in Tom, Tessa and Jane respectively* Earlier 
than this, the situation Is less clear-cut as there were fewer tokens In sentences. 
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CHILD 
AGE 


TOM 
2;l,n 


TES^A 
2; 0,10 


JANE 

2;1.16 


OVERALL 

ISOLATED 

SENTENCE 


32.33(15) 
82,50(2) 
24.61 (13) 


?3.^!0(L5) 
97.SS(li) 
83.25(4) 


71.39(13) 

72.28(7) 

70.33(6) 


OVERALL 

ISOLATED 

SENTENffi 


61.93(14) 
98.50(2) 
55.91 (12) 


100.;JO(15) 
117.113(6) 
88. M (9) 


109.23 a3) 
176.00(2) 
97.09 (11) 


OVERALL 

ISOLATED 

SENTENCE 


91.60(15) 

- (0) 
91.60(15) 


125 .,93(15) 
145 , 2-! (7) 
109,7. '8) 


124.62(13) 
127.14 (7) 
121.66(6) 



Table 4, Eogllsh two-yaar-olds ■ mean VO] values In milllseaonds 
and ntimber of tokens for volcelei^s steps werall end broken down 
by speech contexts* 



Wa would need more data to evaluate whether the children modified their speech ac- 
cording to GOntext in this manner when trhey irst produced two word sentences. 



6.0 Comparison with other studies 



Two other studies of English-speaking children conflict with each other 
in the mean VOT values they report for children around this age. Zlatln 6 Koenlg- 
sknecht 1976 report data for two-year--old childrc.n (ages 2;6 to 3;0) and six-year- 
olds; for both age groups tokens were Bpn'«n in isolation and means for /pjt/ 
were approximately 70ms and the mean %cl /k/ approximately 80ms, Gilbert 1977, 
studying apical stops from the spontan<iou& speech of children aged 2; 7 to 3;3i 
gives an overall mean of 140ms • '^r /t/, lur means for isolated tokens agree 
with Zlatin & Koenlgsknecht ' s al chough our values are approxdinately ICfms* longer 
than theirs, (toe difference l^t^een the two studies is that Zlatin &. Koenlgs-- 
knecht's values for each place c.; articulation were obtained from repetitions of 
only one or t\so words,) The Mans reported by Gilbert are much longer than either 
of these two studies; the diti^irenceE we have found betwem different contexts are 
not great enough to account r such long VOT values* nor are there obvious differ^ 
encea in methodology to account for f hem. Since Gilbert's mean value covers six 
children. It is unlikely that the di ' f erence can be attributed to individual var- 
iation* We can offer no way of race veiling the different figures. 
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Our results with English-^speaking children are comparable to our findings 
on four-^year-old Cantonese-speaking children (Clumeck et al 1979). Like English 
Cantonese has a VOX distinction between short lag and long lag and f our-^yaar-old 
Cantonese speakers had a clear voicing distinction in terms of VOT. However, both 
the English and the Cantonese results are very different from our findings with 
speakers of Spanish (>kcken and Bartou 1979), At four years the Spanish speakers 
we studied did not have a clear voicing distinction in terms of VOX, There ara 
two important differences between English and Spanish in their voicing contrasts. 
Firstly, in terms of VOX, the voicing distinction in Spanish is a contrast between 
lead and short lag. Secondly ^ the voiced phoneraes have commonly occuring alio- 
phones which affect word-initial stops. The influence of these two factors on the 
acquisition of the voicing contrast in Spanish is dircusred in detail in ^^acken 
& Barton 1979* 



In general our English results agree with those of other investigators 
(Kewley^Port & Preston 1974; Zlattn & Koenigsknecht 1976) that once a clear voicing 
contrast is acquired there appears to be little change between then and four years. 
However, the interpretation Zlatin & Koenigskjiecht put on their data Is that ac^ 
quisition of the voicing contrast can be characterised as a consistent movament 
towards adult values; this does not fit in with the overall pattern observed in 
our studies. There was not a gradual extension of the VOT values of voiceless stops 
but rather the children first overshot the adult values and then only gradually drew 
them back towards the adult onas. Even at age four^ our subjects produced voiceless 
stops with VOT values that were longer than their adult counterparts. 



There have been relatively few instrumental studies of other phonetic 
parameters once children have acquired most of their phonology. Nevertheless^ 
studies of duration of consonant clusters (Gilbert & Purves 1977; Menyuk S Klatt 
1975; Hawkins 1978) and other segments (DiSimoni 1974; Smith 1978) have also found 
children's productions at this age to have greater duration than adults. It has 
been proposed that in tems of duration children's productions do not become adult- 
like until age seven (Gilbert & Purves 1977) or older (DiSimoni 1974). Our data 
support the postition that once children have acquired productive control of the 
phonological oppositions of their language they continue to refine phonetic de- 
tails for several years. Our analysis of younger children "S data nevertheless 
demonstrates that at least as early as 1;9 there is evidence of children modi-- 
fying duration according to the speech context in which a word is uttered. 
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